Application of finite-state transducers to the acquisition of verb subcategorization information
نویسندگان
چکیده
This paper presents the design and implementation of a finite-state syntactic grammar of Basque that has been used with the objective of extracting information about verb subcategorization instances from newspaper texts. After a partial parser has built basic syntactic units such as noun phrases, prepositional phrases, and sentential complements, a finite-state parser performs syntactic disambiguation, determination of clause boundaries and filtering of the results, in order to obtain a verb occurrence together with its associated syntactic components, either complements or adjuncts. The set of occurrences for each verb is then filtered by statistical measures that distinguish arguments from adjuncts.
منابع مشابه
A Connectionist Model of Verb Subcategorization
Much of the debate on rule-based vs. connectionist models in language acquisition has focussed on the English past tense. This paper investigates a new area, the acquisition of verb subcategorization. Verbs differ in how they express their arguments or subcategorize for them. For example, “She gave him a book.” is good, but “She donated him a book.” sounds odd. The paper describes a connectioni...
متن کاملA Bootstrapping Approach to Parser Development
This paper presents a robust parsing system for unrestricted Basque texts. It analyzes a sentence in two stages: a unification-based parser builds basic syntactic units such as NPs, PPs, and sentential complements, while a finite-state parser performs syntactic disambiguation and filtering of the results. The system has been applied to the acquisition of verbal subcategorization information, ob...
متن کاملBengali Verb Subcategorization Frame Acquisition - A Baseline Model
Acquisition of verb subcategorization frames is important as verbs generally take different types of relevant arguments associated with each phrase in a sentence in comparison to other parts of speech categories. This paper presents the acquisition of different subcategorization frames for a Bengali verb Kara (do). It generates compound verbs in Bengali when combined with various noun phrases. ...
متن کاملSemitic Morphological Analysis and Generation Using Finite State Transducers with Feature Structures
This paper presents an application of finite state transducers weighted with feature structure descriptions, following Amtrup (2003), to the morphology of the Semitic language Tigrinya. It is shown that feature-structure weights provide an efficient way of handling the templatic morphology that characterizes Semitic verb stems as well as the long-distance dependencies characterizing the complex...
متن کاملLearning Subcategorization
A method to identify the subcategorized constituents of a verb (its complements) automatically in a sentence is useful in various areas of Natural Language Processing (e.g. automatic acquisition of subcategorization lexicons, parsing, acquisition of verb semantics, information retrieval). I will describe a method for subcategorization identification that uses memorybased learning. Train and tes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Natural Language Engineering
دوره 9 شماره
صفحات -
تاریخ انتشار 2003